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Multihour engineering is a technique for designing trunk networks 
when the hours of peak traffic loads between various pairs of offices 
do not coincide. A new descent-type computational algorithm for the 
multihour engineering problem is derived. This algorithm obtains the 
unique solution to the minimization of the multihour cost function, 
which is strictly convex but only piecewise differentiable. The nonin- 
teger minimum-cost solution is subsequently rounded to the nearest 
allowable integer solution to give a realizable network. The new algo- 
rithm is applied to three numerical examples from the California 
network. The results are compared with the nonoptimal, nonunique 
solutions obtained with an earlier algorithm, and with the traditional 
single-hour solutions. 

I. INTRODUCTION 

This paper describes a numerical algorithm which obtains the unique, 
optimal noninteger solution to the multihour traffic network engineering 
problem. This solution is subsequently rounded to the nearest allowable 
integer solution to yield a unique, near-optimal realizable network. 

As described in Ref. 1, multihour engineering is a procedure whereby 
a least-cost traffic network is engineered for more than one set of 
point-to-point loads, subject to the constraint that blocking on any 
last-choice trunk group not exceed a specified value. For networks which 
exhibit noncoincident traffic patterns,* the multihour engineering 
method has been shown to achieve significant capital-cost savings over 
the conventional single-hour engineering procedures. 1 

The results reported in Ref. 1 were based on an algorithm which op- 
timizes the high-usage trunk group sizes* one at a time, in a fixed but 

* Traffic loads between different pairs of offices are said to be noncoincident if their 
highest average values occur in different hours, or at the same hour but in different sea- 
sons. 

1 High-usage groups are direct groups which carry the majority of the load between those 
pairs of offices which have a large enough community of interest to warrant direct trunk- 
ing. 
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arbitrary sequence, until no further cost reductions can be obtained. This 
algorithm is called a "coordinate-search" algorithm here. Such an al- 
gorithm has the undesirable property that it does not generally converge 
to a unique solution of the multihour problem. It can converge to any 
one of a family of suboptimal solutions, depending on the initialization 
of the algorithm, and on the specific order in which the calculations are 
performed. 

The practical reasons for obtaining the optimal — and hence (as shown 
in Section III below) unique — solution to the multihour problem are as 
follows: First, the periodic re-engineering of the network in response to 
new load forecasts is facilitated. (In the Bell System, most networks must 
be re-engineered at least once each year.) A unique solution guarantees 
that changes in trunk-group sizes from one forecast period to the next 
reflect only changes in the loads. In contrast, the coordinate-search al- 
gorithm can produce changes in trunk-group sizes which are as much 
a function of nonuniqueness as they are of actual alterations in the loads. 
It is not possible to distinguish between these two effects, and thus use 
of the coordinate-search algorithm could lead to excessive rearrange- 
ments. Second, capital-cost savings with respect to the coordinate-search 
solutions can be realized in most cases. 

The essential difficulty of the multihour engineering problem arises 
from the fact that the network cost function is not differentiable ev- 
erywhere in its domain. The algorithm presented here, however, is as- 
sured of convergence to the minimum-cost noninteger solution by the 
convexity of the cost function and by the particular mechanism for ex- 
ecuting the search process. 

II. MULTIHOUR ENGINEERING— THE MODEL AND ITS COST FUNCTION 

The model of the network considered in this paper is shown in Fig. 1. 
Traffic which is destined from the single originating office to one of n 
terminating offices is first offered to the appropriate one-way high-usage 
trunk group. If all the trunks in that group are busy, the traffic overflows 
and is offered to a final group which routes this parcel to a tandem switch, 
from where it is sent to its destination via a tandem-completing group. 
The final and tandem-completing groups are sized so that the blocking 
probability on each is 0.01 during its respective busy hour. The object 
of the engineering process is to determine the high-usage trunk group 
sizes which minimize the cost of the network subject to the blocking 
constraints on the alternate routes. 

The cost of the network can be divided into four components, which 
are defined below: 

(0 The direct-route cost: It is assumed that the cost of each high-usage 
trunk group is directly proportional to the number of trunks in the 
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Fig. 1 — Network model. 



group. + If the cost per trunk of the ith high-usage group is Cdi, and there 
are x,- trunks in this group, then the total cost for high-usage trunks in 
this network is given by 



Cd = E CdiXi. 



(1) 



j=i 



In the theory which follows, as well as in the algorithm based on this 
theory, x,- is treated as a nonnegative real variable. 

(u) The final-route cost: Let ft = {1, 2, . . . Ji \ be the set of hours for 
which the network is to be engineered, and let h (E. ft. Let a,/, be the load, 
in erlangs, offered to the ith high-usage group in hour h. Then the ov- 
erflow from this group in hour h is given by 



a ih = a ih B(xi, aih) 



(2) 



where B(- , a,/,) is a strictly convex and continuously differentiable 
function of x, which agrees with the Erlang loss function on the integers. ' 
The overflow parcels from all the high-usage groups are combined and 
offered to the final trunk group. It is assumed that, in addition to the 
overflow traffic, the final group also has offered to it a first-routed load 
in hour h, designated by Afh- 



t Such an assumption is necessary, since the eventual realization of the network in terms 
of facilities is not known at the time that the groups are sized. Thus, average costs per trunk 
are used in approximating the eventual cost of each group. 

t Such an interpolating function can always be constructed since B{n — 1, a) - B(n, a) 
> B(n, a) - B(n + 1, a), n = 1, 2 2 
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In sizing high-usage trunk groups, it is customary to approximate the 
number of trunks required in the final group by dividing the total load 
offered to this group by its so-called "marginal capacity." 1>3t If the cost 
per trunk of the final group is c/ and its marginal capacity 7/, then the 
cost of sizing the final to carry only its hour-/i load is approximated 
by 

CfH-^Ufh+tau). (3) 

Since the final group must be engineered for its busiest hour, its ap- 
proximate cost is 

C f = max C fh . (4) 

h g n 

The actual sizing of the final group (which takes place only after all the 
high-usage groups have been sized) is done more precisely, of course. 

(Hi) The switching cost: It is assumed that the cost of switching is 
proportional to the load, with a unit-cost per erlang of c s . Let A s h denote 
the load switched by the tandem in hour h, excluding the overflows from 
the n high-usage groups. (The first-routed load on the final in hour h, 
A/h, is included in A a h-) Ignoring the blocking on the final group, the cost 
of switching only the hour-/i load is 



C s h = c s (A sh + E auA. 



(5) 



The cost of the tandem switch, when engineered for its busy hour, is 
then 

C s - max C s h- (6) 

h e » 

(iv) The tandem-completing costs: The total load offered to the ith 
tandem-completing group in hour h is A,/, + a,/,, where am is the overflow 
from the ith high-usage group (neglecting the blocking on the final group 
and at the tandem) and Ajh is the remaining load destined to the ith 
terminating office via the tandem. As in the case of the final group, the 
size of the tandem-completing group is not computed exactly, but rather 
approximated by dividing its offered load by its marginal capacity. Let 
y ti be the marginal capacity of the ith tandem-completing group, and 
c t i its cost per trunk. Then the cost of sizing this group to carry only its 
hour-/i load is 



t While the marginal-capacity assumption is not appropriate for determining actual 
trunk requirements on the final group, it is sufficiently accurate for the comparative 
purpose to which it is put here. 
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C tlh = — (A ih + aih ), (7) 

Iti 

and sizing this group for its busy hour results in a cost 

C ti = max Ctih- (8) 

h e ft 

The cost of providing trunks for all tandem-completing groups is thus 

C t = t C ti . (9) 

The total cost of the network is simply the sum of these four compo- 
nents: 

C(x) ~C d + C f + C s + C t 
= L CdiXi + -*- max ( A fh + £ <* ih ) 

+ c, max ( A sh + L <*ih) 
n Cti 

+ E — max (A ih + ctih), (10) 

i'= 1 7ti hG?t 

where x = \xi, . . . ,x n \ is the n-vector of high-usage group sizes. This 
function is called the "multihour cost function." Note that the final, 
switch, and tandem-completing costs may attain their maxima for dif- 
ferent values aih, since each of these alternate-route components may 
be busy in a different hour. 

The object of multihour engineering, then, is to minimize the cost 
function defined by eq. (10) with respect to the high-usage trunk group 
sizes, i.e., to determine x = x* such that 

C(x*) = min C(x) (11) 

where the set % is defined by 

% = \x:xi >0, i = l,...,n\- (12) 

From the point of view of the (continuous) multihour cost function, 
any x is "feasible" if x G %■ Of course, an actual network is realizable 
only in whole trunks (or, in the presence of modular engineering rules,"'' 
in terms of whole modules of trunks). Thus, the noninteger solution x* 



t The uncertainty in load forecasts and the inherent modularity of some facilities have 
recently led to the engineering and administration of some networks in modules of 
trunks. 
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is subsequently rounded to an integer (or modular) solution, as discussed 
in Section IV. 



III. MINIMIZATION OF THE MULTIHOUR COST FUNCTION 
3.1 A reformulation 

Equation (10) expresses the cost of the network as the sum of a linear 
term and n + 2 maxima of sets of nonlinear terms. For the analysis which 
follows, it is convenient to rewrite this cost function in terms of a single 
maximization operator. 

Let M be a vector-valued index set with elements n = (pi, jU2, . . . ,Mn+2). 
Each component of n, in turn, is a member of the set fi m {1, 2, . . . ,H], 
i.e., M = \n = 0*1, • • • ,Mn+2): fii E &1 Let |C„(x): nEM\bea new family 
of cost functions, called "elementary cost functions," which are defined 
by 

C„(x) ± C d (x) + C /mb+1 (*) + C -|1|1+2 U) + £ C« w (*). (13) 

i=l 

In this equation, C t ,- W (x) is the cost of sizing the tth tandem completing 
group for its hour-tii load, C/ Mn+1 (x) is the cost of sizing the final for its 
hour-jtin+i load, and C siln+2 (x) is the cost of sizing the switch for its 
hour-Mn+2 load, as defined by eqs. (7), (3), and (5), respectively; the 
functional dependence upon the trunk-group-size vector x is explicitly 
indicated. It follows from eq. (10) that the multihour cost function is 
obtained from eq. (13) by maximizing each term on the right-hand side 
with respect to the appropriate component of ft: 

C(x) = C d (x) + max C/ Mn+1 (x) 

Hn+l € "» 

n 

+ max C s „ n+2 (x) + L max C titli (x). (14) 

This term-by-term maximization, however, is equivalent to maximizing 
C^x) over all possible choices of n: 

C(x) = max C M (*). (15) 

In other words, the multihour cost function can be viewed as the point- 
wise maximum of the elementary cost functions defined by eq. (13). The 
multihour engineering problem now has the following form: Determine 
the vector x * E % with the property that 

C(x*) = min max C M («). (16) 

x G % M G M 
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Fig. 2 — Level curves of the multihour cost function. 

3.2 Some properties of the multihour cost function 

The multihour cost function has two fundamental properties on which 
the algorithm for finding its minimum is based: (i) it is a strictly convex 
function of trunk group sizes; (ii) it is only piecewise differentiable in 
these variables. Each member of the family of functions C^(x), \i £ M, 
is the sum of differentiable, strictly convex functions plus a linear term, 
and is therefore itself strictly convex and differentiable. 4 Since the 
multihour cost function is the pointwise maximum of a family of strictly 
convex functions, it is also strictly convex, 4 but not necessarily differ- 
entiable everywhere. In particular, if two or more elementary cost 
functions are maximal at some point (and hence their graphs intersect), 
the multihour cost function is generally not differentiable at that 
point. 

Figure 2 illustrates the possible nondifferentiability of the multihour 
cost function for an example with two high-usage groups, and in which 
only two distinct elementary cost functions [denoted simply by Ci(x) 
and C 2 (x)] are maximal anywhere. The dashed curve separates the two 
regions in the *i - x 2 plane in which Ci(x) > C 2 (x) and C 2 (x) > Ci(x), 
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respectively. The solid lines are the level curves of C(x), i.e., the loci of 
points for which C(x) = £, where £ is a constant. The location of the 
minimum, x*, is indicated by the circled point. Clearly, the multihour 
cost function is not differentiable anywhere along the dashed curve, 
where the graphs of the two elementary cost functions intersect. 

This simple example also illustrates why a coordinate-search algorithm 
may fail to converge to the minimal solution. Figure 3 shows the same 
level curves as Fig. 2, together with three typical paths which a coordi- 
nate-search algorithm might follow: Path I (0-a-b) and Path II (0-c-d-e) 
start at the same initial point, but their orders of search are reversed. 
Path III (O'-f) starts with a different initial solution. Note that the three 
paths terminate at three different locations (b, e, and f), none of which 
is the minimal solution. In this example, the algorithm stops whenever 
it reaches a point x for which Ci(x) = C%(x) and at which no further 
decrease in the function C(x) can be achieved by changing only one 
coordinate at a time. 



3.3 A feasible search direction 

The principle of the algorithm presented in this paper is to perform 
a sequence of searches through %, in "feasible directions of descent." 
A feasible direction of descent is the direction of any vector y(x) with 
the property that if x E %, there exists some \ > such that x + Xy E % 
and C(x + Xy) < C(x). Whenever such a direction exists, a step size for 
the search is chosen to maximize the decrease of the multihour cost 
function in that direction, while maintaining the feasibility of the solu- 
tion. 

In order to determine a direction of descent, we use the concept of the 
one-sided directional derivative of C(x) with respect to a vector 
y E y(x), where V(x) = |yG R n : for x E % and for some X > 0, x + 
Xy E %}• This derivative is denoted by C'(x; y) and is defined as fol- 
lows: 

c , (x . y)A ^cu±M^CM (17) 

AjO X 

C'(x; y) is thus the rate of change of the function C(x) in the direction 
of y, multiplied by \\y II, where ||-|| is the Euclidean norm. If C(x) is con- 
vex, C'{x;y) exists and is a convex function of y for every x at which C 
is finite. If C(x) is actually differentiable at x, then 

C'(x;y)=(y,VC(x)) (18) 

where V is the gradient operator and <•, •> denotes the scalar (or inner) 
product of two vectors. 4 

Substituting eq. (15) into the definition of the directional derivative 
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Fig. 3 — Typical paths for a coordinate-search algorithm. 



max CAx + Xy) — C(x) 



C'(x;y) = lim 

xjo 



nGM 



(19) 



Let 3{x) c M be the set of indices of those elementary cost functions 
which are maximal at x: 

3{x) ± | M : C„(x) = C(x)}. (20) 

For each n €: M, C„(x) is continuous for all x £ %• Consequently, for 
each x £ % and for each y £ 2/(x), there exists a X' > such that for 
all X with < X < X', 



C(x + Xy) = max C^x + Xy). 



(21) 



In words, there exists a neighborhood of x in which no elementary cost 
function can be maximal which is not also maximal at x. Therefore, the 
maximization over \l £ M in eq. (19) can be replaced by a maximization 
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over M G .7 = •?(*): 



max C„(x + Xy) - C(x) 

C'(x;y) = lim ^^ . (22) 

xio A 



Since C M (x) = C(x) for each n G3, 

ru ^ v [ C u (x + Xy)-C.{x) -\ 

C'{x;y) = lim max — e e — 

xio /z e ^L a J 

.. r CgU + Ay)-C^(x) 1 

= max lim — B e — 

^ e 'j xio L A J 

= max C;(x;y), (23) 

where C^(x; y) is the directional derivative of C M (x) with respect to y. 
(The order of the lim and max operators can be interchanged since 3 is 
finite and C^ is continuous for each /* G 3.) Since C^(x) is differentiable 
for each n G Mj 

C'(x;y)= max <y, VC„(*)>. (24) 

Thus, the rate of change of C(x) in the direction of y is negative (i.e., the 
direction of y is a feasible direction of descent) if and only if 

max<y, VC M (x)><0, y <EV{x) (25) 

n(EJ 

or, equivalently, 

(y, VC„(x)> < 0, for all /* G 3(x), y G V{x). (26) 

A point at which no feasible direction of descent for C(x) exists must 
be the location of the minimum of C(x). In fact, a convex function C(x) 
defined over a convex domain attains its global minimum at x = x* if 
and only if C(x) is finite and 

C'(x*;y)>0 for ally £V{x*). (27) 

Furthermore, x* is unique if C(x) is strictly convex. 4 Since G 2/(x), 
eq. (27) is equivalent to 

min C'(x*;y) = 0, (28) 

y G V(x*) 



t If x is on the boundary of X, VC^U) is defined as the limit of all sequences VC„[x (1) ], 
VC M [i (2 »], . . . , such that*' 1 ' e % and x<<> -► x. 
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or, with eq. (24) substituted, 



min max <y, VC M (x*)> = 0. (29) 

E 3/(x*) n € 3(x*) 



3.4 The descent algorithm 

Inequality (25) gives the condition for y to be in a feasible direction 
of descent for C(x). Such a y is generally not unique, and it is necessary 
to select a particular direction of descent at each iteration of the algo- 
rithm. A logical choice is the direction of steepest descent for C(x), i.e., 
the vector y* such that 

C'(x;y*) = min C'(x;y) (30) 

y g «p n v 

where <£ is the unit sphere in R n : 

S = \y:\\y\\<i\. (3D 

A solution for y* can be obtained in explicit form, as shown in the Ap- 
pendix, provided 3{x) contains either one or two elements, and V{x) = 
R n (i.e., x is not on the boundary of %). 

While it is possible, at least in principle, to solve for the steepest- 
descent vector in the general case (see the Appendix), the computation 
is cumbersome for three or more elements in 3{x), or if boundary con- 
straints are active. In this case it is more practical to choose a feasible 
search direction based on computational simplicity. For example, if S 
is chosen to be the set 

<£ = |y:|y,| <1, i - 1, . . . ,n\, (32) 

the min-max problem expressed by eq. (30) can be converted into a linear 
program: 

min o 

subject to 

(y, VC M (x)> < a for all n G 3(x) 

|y,| < 1, i=l,...,n 

y, > whenever x t = 0. (33) 

This linear-programming problem is solved by standard methods. Al- 
though the vector y * which solves that linear program may no longer be 
in the steepest-descent direction, the algorithm can still be shown to 
converge. 1 



* It can be shown that the algorithm will converge with y* chosen according to eq. (30) as 
long as £ is any convex, compact subset of ft" containing the origin in its interior. 5 
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In principle, the descent algorithm consists of alternately computing 
a search direction according to eq. (30) and performing a one-parameter 
search to locate the minimum of C(x) along that direction. (Each such 
combination is called an iteration.) While this procedure results in a 
sequence of feasible solutions with strictly decreasing costs, there is still 
no guarantee that this sequence will converge to the minimal solution. 
It is possible, as the result of a phenomenon known as "jamming" or 
"zig-zagging," for the sequence of solutions to converge to a point which 
does not satisfy eq. (29). 6 

The device used here to prevent jamming (and thus assure conver- 
gence to the minimal solution) is similar to that used by Zoutendijk. 6 
This device consists of expanding the set 3(x) to include all those ele- 
mentary cost functions which are "nearly" maximal at x. Let € > and 
define the new index set 

J ( (x) ± \n: C(x) - C,(x) < e}. (34) 

The direction-finding subproblem thus takes into account the directional 
derivatives of all those elementary cost functions which are within e of 
being maximal at x. Similarly, the feasibility conditions are modified 
to prevent the algorithm from attempting to reduce any further those 
trunk-group sizes which are already "nearly" equal to zero. To this end, 
let 8 > and define the set 

2/*(*) = |yGy(x): Ji ^ whenever x t < 8\. (35) 

For notational consistency, the original sets 3{x) and V(x) are hence- 
forth denoted by 3q{x) and Vq{x), respectively. 

The original problem of determining a search direction y* as expressed 
by eq. (30) is now replaced with the problem of finding y = y" which solves 
the min-max problem 

D (x ) = min max <y, VC M (x ) > . (36) 

y G <? n Vh m G 3< 

In this definition, the new symbol D(x) replaces the symbol C'(x; y), 
since the quantity which it denotes is no longer a directional derivative 
in the sense of eq. (17). Note, however, that if J ( (x) = J (x) and 3/ 5 (x) 
= y (x), then y = y* and D(x) = C'{x\ y*). 

Since the inclusion of any additional necessary conditions may overly 
constrain the direction-finding subproblem, the values of e and 8 are 
reduced adaptively throughout the progress of the algorithm, so that e 
— and 5 -*■ 0. The use of this procedure also serves a computational 
purpose, in that e and 8 can be viewed as the tolerances within which 
elementary cost functions are deemed maximal and within which 
trunk-group sizes are considered to be zero, respectively. 
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The descent algorithm is now specified as follows: 

Step 1 Let k be the iteration counter, and set k = 0. 

Select an arbitrary initial solution x (0) £ %. 

Select e and 5. 
Step 2 Compute C [*<*>]. 
Step 3 Determine the feasible search vector y (k) = y and compute 

D [*»>]. 
Step 4 If JJ[x<*>] < -€, go to Step 6. 

If -€ < D[x<*>] < 0, go to Step 5. 

IfD[x<*>] = 0,but.7 € [x<*>] * J [x< k) ] or 2/ 5 [x<*>] * 2/ [x<*>], 

go to Step 5. 

If D[x<*>] = 0and.7 { [x<*>] = S [x^] and2/ 6 [x<*>] = y [* w L 

stop. The solution x (A) = x* has been found. 
Step 5 Set e = c/2 and 5 = 6/2; go to Step 3. 
Step 6 Determine a scalar X (/z) such that 

C[x<*> + X<*y*>]= min C[x<*> + Xy<*>] 

x e a<*> 

where 

A<*> = IX: x<*> + Xy<*> G %}• 
Setx ( * +1) = x (A) + X (fe )y (fe) , setfe =fe + 1, and go to Step 2. 

The adaptive reduction of « and 5 is contained in Steps 4 and 5. 
Whenever |D[x (/e) ]| becomes sufficiently small (perhaps even zero), e 
and 5 are divided in half. If this reduction results in a decrease of the 
number of near-maximal elementary cost-functions or near-active 
boundary constraints, the direction-finding subproblem may be less 
constrained, and a new search direction may be found. If, on the other 
hand, the sets 3 ( {x) and 2/a(x) remain unaltered after c and b are divided 
in half, D[x (k) ] remains unchanged as well, and the algorithm proceeds 
directly to Step 4. 

It can be shown that this algorithm generates a sequence of solutions 
\x (k) ; k = 0, 1, 2, . . .} which is either finite, with its last term x* satisfy- 
ing 

C'(x*;y*) = 0, (37) 

or infinite, with any accumulation point x* satisfying eq. (37). 5t It was 
shown earlier, however, that eq. (37) is a necessary and sufficient con- 
dition for x* to be the unique, minimal noninteger solution to the mul- 
tihour engineering problem. 



t A practical stopping rule is suggested in Section 4.3. 
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IV. NUMERICAL RESULTS 

4.1 The model 

Three offices from the California network (Gardena, Compton, and 
Melrose) were engineered with the descent algorithm developed in 
Section III. For each office, the loads in two distinct hours (a morning 
hour and an evening hour) were considered. For the sake of simplicity, 
the loads Afh, A s h, and A,-/,, i — 1, . . . ,n (hereafter called "fixed loads," 
since they do not depend on the variables Xi) were assumed to be zero 
in both hours. All trunks were assumed to cost $1000, and the switching 
cost was $62/ccs. t The final and tandem-completing groups were as- 
sumed to have a common incremental capacity of 30 CCS/trunk. There 
were 37 high-usage groups in the Compton office, 43 in Gardena, and 35 
in Melrose. 

These three offices, together with the loads and costs, are the same 
as those used by M. Eisenberg; 1 they are used here again in order to allow 
direct comparisons with his results. While the simplifying assumptions 
in these cases certainly influence the numerical results, they preserve 
the essential properties of the multihour cost function and thus can be 
expected to demonstrate convergence characteristics similar to those 
which would occur in general. 

4.2 Implementation of the descent algorithm 

The assumption of zero fixed loads on the switch and on the tan- 
dem-completing groups allows the multihour cost function to be sim- 
plified considerably. Under this assumption, the busy hours on the 
tandem-completing groups are known a priori: the busy hour on the ith 
tandem-completing group is the same hour in which the load offered to 
the ith high-usage group is largest. Thus, only the final group and the 
switch have busy hours which may be functions of the high-usage group 
sizes. Furthermore, in the absence of fixed loads, the loads offered to the 
final group and to the switch are identical. Consequently, at most two 
of the elementary cost functions associated with each of these networks 
can ever be maximal. ' For the sake of notational simplicity, these two 
functions are denoted by C\(x) and C 2 (x), respectively. 

The feasible search vector for each iteration was chosen by specifying 
the set £ to be the unit sphere. For experimental purposes, two distinct 
initial feasible solutions were used for each of the three offices: 



+ 1 erlang = 36 CCS (hundred call-seconds per hour). 

* There are H n+2 elementary cost functions associated with a network with n high-usage 
groups which is engineered for H hours. Recent experience with more extensive data, in- 
cluding fixed loads, indicates that the busy hours of the tandem switch and of the tan- 
dem-completing groups are usually not affected by the sizes of the high-usage groups of 
the office which is being engineered. 7 Thus, one may need to consider only H elementary 
cost functions in a practical situation. 
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x f <°> = max cab i=l,...,n (38) 

hew 

and 

*.<o) = 0) i=l,...,n. (39) 

At each iteration, the optimal step size X (fe) was determined by a simple 
parameter search: First, the location of the minimum of C(x) along the 
search direction was bracketed between two points whose largest com- 
ponent-difference was 0.01 trunks. The minimal point was then com- 
puted more precisely, either by a quadratic approximation or by a linear 
interpolation, depending on whether J t (x) contained one or two elements 
at that point. The Erlang-B function for noninteger trunk-group sizes 
was evaluated by an approximation due to Rapp, 8 and its partial deriv- 
atives were approximated by central differences with a step size of 0.01. 
The initial values for e and 8 were set equal to 10 _3 C[x (0) ] and 0.1, re- 
spectively, and in all cases the algorithm was arbitrarily terminated after 
25 iterations. 

4.3 Convergence of the descent algorithm 

The behavior of the algorithm was similar for all three of the offices 
and for both starting points. For each office, the final solutions obtained 
with each of the two starting points differed by less than 0.003 trunks 
on any high-usage group. Figures 4 to 7 summarize the convergence 
characteristics of the algorithm for the Gardena office, with the starting 
point given by eq. (38). 

The cost of the network at each iteration, C[x (fe) ] (or simply C (A) ), as 
a function of the iteration number, k, is shown in Fig. 4. The cost de- 
creased monotonically with k, and the rate of change became very small 
after the first few iterations (e.g., C< 4) = 1.0003 C< 25 >). 

Figure 5 shows the magnitude of D[x {k) ] (or simply \D {k) \, and the 
value of €, as functions of k. As this figure indicates, |D (A) | , e, and 8 all 
approach zero as k -» °° (recall that e < |D (fe) | for all k > 0, and that 8 
~ (). Thus, it is evident that the algorithm was converging to the minimal 
solution when it was terminated. 

Unlike C (k) , \D {k) \ did not decrease monotonically. As the algorithm 
reduced t , there was an occasional iteration (k = 0, 2, and 6) at which the 
solution point x ik) lay outside the region for which | C[ k) — C^ k) | < e. As 
a result, ^ e [x (/j) ] contained only one element at these iterations, and y 
was given by -Vjmax [C[ k) , C? J ] }. (The dotted lines in Fig. 5 show the 
magnitudes of the gradients VCi and VC 2 as functions of k.) For the 
remaining iterations yjx (fe) ] contained both indices, so that>> and D {k) 
were computed via eqs. (61) to (63) in the Appendix. Note that since e 
-* as k -*■ °° and 3 ( [x (k) ] contained both elements for all k>l, the 
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Fig. 4 — Convergence of descent algorithm: network cost. 

solution point x* must be located on the intersection of the graphs of 
the two elementary cost functions. 

The relative change in the solution from one iteration to the next, as 
measured by the Euclidean "distance" ||x (A-1) — x (fe) ||, is shown in Fig. 
6. Note that this quantity also tended to zero as k increased, although 
not monotonically. In particular, this plot shows that for each iteration 
(except the first one) at which J ( [x (A) ] contained only one element, the 
corresponding step size was small. Thus, the algorithm generated a se- 
quence of solutions which tended to follow the intersection of the graphs 
of Ci(x) and C 2 (x) toward the minimal solution x*. Whenever the so- 
lution point moved too far from this intersection, a small step was taken 
to get back into the region defined by | C\{x) — C2C*) | ^ e, and the search 
along the intersection was resumed. 

Figure 6 also suggests how the norm of the change in trunk-group sizes 
can be used as a measure for a practical stopping rule. Let be a prese- 
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Fig. 5 — Convergence of descent algorithm: |D[x (fe) ]| and t. 

lected threshold. Then the algorithm is deemed to have converged, and 
is terminated, if k > 2 and \\x (k) — x ( * -1) || < for two consecutive iter- 
ations. The last solution x (/e) is then an approximation to the exact so- 
lution x*. 

Figure 7 shows how the solution point x (fe) converged. For this purpose, 
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Fig. 6 — Convergence of descent algorithm: change in trunk group sizes. 

the Euclidean distance between x (fe) and x {25) , ||jc </e> - x (25) ||, is plotted 
as a function of k. Note that x (fe) reached a small neighborhood of x (25) 
in relatively few iterations (e.g., within one trunk after only four itera- 
tions, and within 0.1 trunks after ten iterations). 

4.4 Further results 

Table I shows the offered loads for the Gardena office (in CCS), and 
the trunk-group sizes (optimal, and rounded to the nearest integer) 
obtained by the descent algorithm. For the purpose of comparison, the 
following other sets of trunk-group sizes are included: 

(i) For each high-usage group, the smallest and the largest number 
of trunks (in integers) selected from a set of solutions generated by the 
coordinate-search algorithm with 20 combinations of starting points and 
trunk-group orderings. 
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Fig. 7 — Convergence of descent algorithm: trunk group sizes. 

(U) The number of trunks in each high-usage group (again in integers) 
as determined by the so-called "cluster-busy-hour" method. 11. 

This table illustrates the trunk-group-size variability when a coordi- 
nate-search algorithm is used, relative to the optimal — and hence 
unique — solution. Note that some groups varied by as many as eight 
trunks, while only six groups showed no variability. In view of the 
practical problems associated with such an uncertainty, as discussed in 
the Introduction, the need for obtaining a unique solution is clear. 

Table II lists the costs' of the optimal solutions for Compton, Gardena, 



t Cluster-busy-hour engineering is a traditional method for sizing traffic networks, in 
which trunk-group sizes are chosen to minimize an elementary cost function whose al- 
ternate-route costs are all evaluated in a single, time-consistent hour. The hour selected 
is that hour in which the sum of the first-offered loads to all the high-usage groups which 
overflow to a common final group, plus the first-offered load to that final group, is larg- 
est. 

t All costs are computed via eq. (3), and thus include the marginal-capacity approxi- 
mation. The costs of comparable networks reported in Ref. 1 reflect the actual required 
sizes of the final groups as determined after the high-usage groups have been sized. 
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and Melrose. For the purpose of comparison, the table also lists the costs 
of the following other networks: 

(i) A network with the optimal trunk-group sizes rounded to the 
nearest integers. 

(ii) The two networks with the lowest and highest costs, respectively, 
selected from the set of 20 solutions generated by the coordinate-search 
algorithm. 

(Hi) The cluster-busy-hour network. 

These results show that while some combinations of starting points and 
trunk-group orderings for the coordinate-search algorithm yielded so- 
lutions whose costs were only a fraction of a percent higher than the 
optimum, other combinations led to substantially higher costs (up to 
5.5 percent in the case of Compton). 

Since a network is realizable only in integer trunk-group sizes, the 
optimal (noninteger) solution must be rounded in some way. As indicated 
by the results in Table II, rounding of optimal trunk-group sizes to the 
nearest integers is an attractive alternative: It is simple; it yields an es- 
sentially unique solution; and, although it generally does not lead to the 
optimal integer solution, it yields networks whose costs are only slightly 
higher (from 0.2 to 0.44 percent in the three cases examined) than those 
of the optimal, noninteger solutions. (Subsequent studies 10 have shown 
that, among several practical approaches, rounding the optimal solution 
to the nearest integers, or to the nearest multiples of the module size, 
is indeed the policy most likely to minimize cost.) 

The cluster-busy-hour networks are included to provide some per- 
spective. It is evident that while the cost of any solution obtained by the 
coordinate-search algorithm is substantially lower than the cost of the 
cluster-busy-hour solution, additional nonneligibile capital savings may 
be obtained by computing the optimal solution via the descent algo- 
rithm. 

A comparison of the rounded optimal solution with the cluster- 
busy-hour solution reveals that these two networks are not very different. 
(The average absolute difference in high-usage group sizes is only 0.8 
trunks, while the average group size for the rounded optimal solution 
is 7.1 trunks.) The cost of the cluster-busy-hour network, however, is 11.7 
percent higher than that of the integerized optimal solution. The sen- 
sitivity of the cost to relatively small changes in trunk-group sizes is a 
consequence of the "shape" of the multihour cost function. As Fig. 2 
suggests, the contours of this function are long and narrow, and the slope 
is steep in directions normal to the intersection between the two ele- 
mentary cost functions. 

In contrast, C(x) is much less sensitive to changes in x along the in- 
tersection of the graphs of C x (x) and C 2 (x). It is for this reason that the 
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Table I — High usage trunk group sizes — Gardena 
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Table II — Network costs 
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largest cost difference between the 20 sample solutions generated by the 
coordinate-search algorithm is only 4.5 percent. (As suggested by Fig. 
3, all termination points for the coordinate-search algorithm lie on this 
intersection in this particular example.) 

The high sensitivity of network cost to trunk variations in some di- 
rections is not, of course, a consequence of engineering a network to a 
multihour (minimum-cost) solution. The cost function C(x) as defined 
by eq. (10) represents the actual cost of the network. The multihour 
method is simply the one which recognizes this actual cost during the 
sizing process. 

The imposition of modular engineering rules tends to diminish the 
capital savings of multihour engineering over single-hour engineering, 
by blurring some of the fine structure of the networks. A substantial 
portion of the savings can still be realized, however, as long as the module 
size is not large relative to the average group size. For example, in the 
three networks studied here, where the average group size is 7.4, rounding 
the high-usage group sizes to the nearest multiples of six trunks resulted 
in a reduction of the original savings by approximately one-fourth. 



V. SUMMARY 

The cost of a traffic network which gives a minimum specified grade 
of service on the last-choice routes in more than one hour can be ap- 
proximated by a strictly convex, although possibly nondifferentiable, 
function of the high-usage trunk-group sizes. A descent algorithm, which 
can be shown to converge to the unique, noninteger minimum-cost 
network, has been developed. The noninteger solution is subsequently 
rounded to the nearest allowable integer (or modular) solution to yield 
a realizable network. The main advantage of this algorithm relative to 
the coordinate-search method is that the uniqueness of the solution 
prevents unnecessary, expensive rearrangements from being undertaken 
as traffic loads change with time. A secondary advantage is a small 
possible additional saving in network capital cost. 

The results obtained from applying the descent algorithm to three 
numerical examples (and to others not discussed here) demonstrate that 
even after only a few iterations a sufficiently high degree of precision can 
be achieved to ensure the reproducibility of the results and hence the 
stability which motivated the design of the algorithm. 
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APPENDIX 

The steepest-descent direction for the multihour cost function 

Consider the problem of finding a vector y* E R n with the property 
that 

C'{x,y*)= min C'(x;y) 

llyllsi 

= min max (y, VC M (x)> (40) 

||y||<i m e ^(x) 

where C'(- ; •), C M (x) and 3{x) have all been defined in Section III. 
Without loss of generality, let J(x) ■{!,... ,m\ and let Z be the convex 
hull of the set of all the inner products (y, VC/(x)>, ; = !,... ,m: 



Z = 



*:* = £ \,<y,VC,(x)>, £a ; =1, X,-i0 



(41) 



In other words, Z is the shortest closed segment of the real line which 
contains all the inner products <y, VCj(x)),j = 1, . . . ,m. Consequently, 
the maximal inner product is also the maximal element in Z: 

max (y,VCj{x)) = max z. (42) 

j e 3(x) 2 g z 

Since the inner product is a linear operator, the elements z defined by 
eq. (41) can be rewritten as 



/ m \ m 

= (y, LW(x)),E Ay-i, x,> 
\ i-i / ;=i 



0. (43) 



Now define a new set, £, as the convex hull of the gradients of the ele- 
mentary cost functions: 



(m m \ 



(44) 



The relationship between the sets and Z is evidently 

Z = \z:z= (y,g),geS\ (45) 

and thus 

max = max (y,g) (46) 

z g z g e 

Combining eqs. (40), (42), and (46) yields 

C'{x;y*)= min max(y,g). (47) 

h\\<igGS 
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Since the two constraint sets on the right-hand side of eq. (47) are convex 
and compact, the minimization and maximization operations can be 
interchanged: 9 

C'{x\y*) = max min (y,g). (48) 

g^g \\y\\<l 

For any g G S with g^O, 

= -\\g\l (49) 



mm 

llyl 



IfS = 0,then 



min (y,g) = 0. (50) 

lly||<l 



Equation (48) is then equivalent to 



C'(x;y*) = max (-Is I) 
ges 

= -minb II (51) 

ge.9 

Lets* be the vector with minimum norm in 9, i.e., 

||s*| = min ||s ||. (52) 

ges 

The desired result is now given by 

C / (x;y*) = -||s*ll (53) 

'0 ,s* = o 
y* = t s* (54) 

lis* II 

The set S and its elements g are called the "subdifferential" and the 
"subgradients," respectively, of the convex function C(x). 4 The steep- 
est-descent vector y * is then in the negative direction of the minimum- 
norm subgradient of C(x). Note, incidentally, that C'(x*\ y*) = — the 
necessary and sufficient condition for C(x*) to be the minimum — is 
equivalent to the condition that G 8 at x*. 

Explicit solutions for y * can now be found for the cases m = 1 and m 
= 2, as follows: 

(i) m = 1: 
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In this case we simply have 

g = VC l (x) = VC(x)=g*. (55) 

(ii) m = 2: 

The subdifferential is now given by 

9 = \g: g = 0VCi(x) + (1 - 0)VC 2 (x), < < 1}. (56) 

If £ S, then y * = g* = 0. Suppose now that (j£ £. The unconstrained 
minimum of \\g\\ is found by setting its derivative (with respect to fi) 
equal to zero: 

J- o \\g\\=0. (57) 

dp 



Since 

the relationship 



Ikll 2 =<*,*>, (58) 



2\\g\\-^\\g\\=-^(g,g) (59) 

is obtained. Thus, since ||g|| j± 0, eq. (57) is equivalent to 

j-Ag,g)=0. (60) 

dp 

Let = ft satisfy eq. (60). Expanding the inner product, taking the de- 
rivative, and solving for j8 yields 

a = l|VC 2 || - (VC b VC 2 ) 

l|VC 1 || 2 + ||VC 2 || 2 - 2<VC 1 , VC 2 > " 

However, since /S is constrained by < < 1, the minimum-norm 
subgradient g* is given by 

£* = 0*VCi+(l-0*)VC 2 (62) 

where 



r = i°; p<0 h (63) 

*% O<0<1 

1, 0>1 
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